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ABSTRACT 

We present a weak lensing mass reconstruction of the interacting cluster 1E0657— 558 in which we 
detect both the main cluster and a sub-cluster. The sub-cluster is identified as a smaller cluster which 
has just undergone initial in-fall and pass-through of the primary cluster, and has been previously 
identified in both optical surveys and X-ray studies. The X-ray gas has been separated from the galax- 
ies by ram-pressure stripping during the pass-through. The detected mass peak is located between 
the X-ray peak and galaxy concentration, although the position is consistent with the galaxy centroid 
within the errors of the mass reconstruction. We find that the mass peak for the main cluster is in 
good spatial agreement with the cluster galaxies and offset from the X-ray halo at 3.4cr significance, 
and determine that the mass-to-light ratios of the two components are consistent with those of relaxed 
clusters. The observed offsets of the lensing mass peaks from the peaks of the dominant visible mass 
component (the X-ray gas) directly demonstrate the presence, and dominance, of dark matter in this 
cluster. This proof of the dark matter existence holds true even under the assumption of modified 
Newtonian gravity (MOND); from the observed gravitational shear to optical light ratios and mass 
peak - X-ray gas offsets, the dark matter component in a MOND regime has a total mass which is at 
least equal to the baryonic mass of the system. 

Subject headings: Gravitational lensing - Galaxies: clusters: individual: 1E0657-558 - dark matter 



1. INTRODUCTION 

It has been long established that the velocity disper- 
sions and X-ray gas temperatures of clusters of galaxies 
are too high to be explained solely by the amount of vis- 
ible matter in the clusters using a physical model with 
Newtonian gravity and general relativity. This obser- 
vation led to the introduction of a "dark matter" com- 
ponent of the mass which interacts with normal matter 
and light only via gravity. Recent observations of clusters 
suggest that the mass is made of ~ 1% of baryons ob- 
servable in optical and infrar ed data, ~ 11% o f baryons 
observable in X-ray data (e.g. lAllen et aljf2002l) . and the 
remaining ~ 88% in the dark matter component. 

An alternative explanation has been that the grav- 
itational force only follows the Newtonian r~ 2 law at 
the level of the force observed in the solar system, and 
that at smaller values the decline with distance is less 
l|Milgromlll98l . This idea of modified gravity (MOND) 
has been used to reproduce the observed rotation ve- 
locities of s piral galaxies without inc lusion of any dark 
matter (e.e. iMcGaugh fc de Bloklll998|) . and could also 
explain the observed velocity dispersions and X-ray tem- 
peratures of clusters without needing any additional mat- 
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ter b eyond the obse rved baryons and a small neutrino 
mass l|Sandersll200ffi . Several arguments against MOND 
have been made using gr avitational lens i ng observations 
of ga laxies and clusters llGavazzil 120021: iHoekstra et alJ 
2002; IMortlock fc TurnerlkoOllT but because of the lack 
of a general relativistic theory giving the strength of 
the interaction between light and gravity in the MOND 
regime these observations can be explained by alterati ons 
in the MOND formalism l|Sanders fc McGaughll200l . 

A definitive test of MOND, however, can be made with 
interacting clusters of galaxies. In the standard CDM 
paradigm, during the initial pass-through the dark mat- 
ter particles and the galaxies are effectively collisionless 
while the X-ray halo is affected by ram pressure. As 
a result, one expects the galaxies and dark matter halo 
to remain spatially coincident following the interaction, 
while the X-ray halo is displaced toward the center o f 
mass of the combined system (e.g. iTormen et al.l feOOS') . 
In the CDM paradigm the mass of the X-ray halo is a 
small component of the total mass, and therefore the 
mass maps created from weak lensing should have the 
primary mass peaks in good spatial agreement with the 
galaxies. In a MOND regime, however, the X-ray gas is 
the dominant component of the total mass. The weak 
lensing mass reconstruction would therefore detect a pri- 
mary mass peak coincident with the gas, which is spa- 
tially offset from the galaxy distribution. 

The z — 0.296 interacting cluster 1E0657-558 provides 
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the ideal c ase in which t o test this theory. First dis- 
covered bv iTucker et alJ l|1995lL subsequent analysis of 
ROSAT HRI data revealed t hat the system is co mprised 
of two merging sub-clusters l|Tucker et alJll998|) . More 
recent Chandra and spectroscopic observations further 
indicate that this mer ger is nearly in the plane of the sky 
ijBarrena et al.ll2002t IMarkevitch et alJ 120021. with the 
lower mass sub-cl uster having recently ex ited the core of 
the main cluster ijMarkevitch et al.ll2002fl. 

The Chandra observatio ns bv lMarkevitch et al.l {2002) ; 
IMarkevitch et alJ l)2003b|) have been particularly valu- 
able in elucidating the dynamical state and geometry of 
this unique system. These data reveal the presence of a 
prominent bow shock leading the lower mass sub-cluster 
(T ~ 6 keV) , which is exiting the core of the main clus- 
ter (T ~ 14 keV) with a relative velocity of 4500 km 
s _1 , determined from the gas density jump at the bow 
shock. Coupled with the current 0.66 Mpc separation be- 
tween the two components, this velocity requires that the 
closest approach of the two components occurred 0.1-0.2 
Gyrs ago. The merger is constrained to be nearly in the 
plane of the sky by the sharpness of the shock front, a 
result consistent with the small line-of-sight component 
of the s ub-cluster veloc i ty der ived from the spectroscopic 
data bv iBarrena et al] ( 2002). Finally, a comparison of 
the Chandra data with optical imaging reveals that the 
X-ray gas associated with the bullet trails the galaxy 
distribution. This latter result, coupled with the sim- 
ple geometry of the system, enables the definitive test 
of MO ND that we descr i be belo w. In an accompanying 
paper. IMarkevitch et al.l l)2003a) utilize the combination 
of Chandra and weak lensing data to also constrain the 
collisional cross-section for self-interacting dark matter. 

This is not the only system which is known to have a 
spatial offset between the galaxies and X-ray gas in a sub- 
component. The high-redshift cluster MS1054-0321 has 
a double-peaked X-ray halo in which the western peak is 
offset from the nearby galaxy overdensity l|Jeltema et al.l 
2001). Unlike 1E0657-558, however, no shock front is ob- 
served in the X-ray data, and as a result the relative ve- 
locities and geometry of the merging components are un- 
known. Further, while a weak lensing ma ss peak has been 
measured near the gal axy overdensity l)Hoekstra et al.l 
1200ft Iclowe et al]l2000|) . the un certainty in the posit ion 
of this mass peak is quite large ({Marshall et al.l l20021 . 

In this paper we use B and / images taken with the 
FORS1 instrument in direct imaging mode on the VLT1 
8-m telescope during 1998 and 2000, which were obtained 
fr om the ESQ archive. These include the images used 
in IBarrena et alJ <|2002f) . but we have independently cre- 
ated the final images from the raw data. In section 2 we 
present the weak lensing analysis of the image and dis- 
cuss the significance and uncertainties in the positions of 
the detected mass peaks. We analyze the photometry in 
section 3 and give mass-to-light ratios for the detected 
mass peaks. Discussion of the results and our conclu- 
sions are presented in section 4. Throughout this paper 
we assume an Q m = 0.3, Qa = 0.7, H = 70 km/s/Mpc 
universe unless stated otherwise. 

2. WEAK LENSING ANALYSIS 

Weak gravitational lensing is a method which can be 
used to measure the surface mass in a region by utilizing 
the fact that the path of a light bundle passing a grav- 



itational potential will be bent by the potential. As a 
result, images of background galaxies which are near a 
massive structure, such as a cluster of galaxies, are de- 
flected away from the structure, enlarged while preserv- 
ing the surface brightness, and distorted such that they 
are stretched tangentially to the center of the potential. 
This third effect, known as gravitational shear (7), causes 
the background galaxies' ellipticities to deviate from an 
isotropic distribution, and the magnitude and direction 
of these deviations is used to measure the mass of the 
structure(s) causing the lensing. This technique of mea- 
suring the mass does not make any assumptions about 
the dynamical state of the mass, and is therefore one of 
the few methods which can be used to measure the mass 
of a dynamically disturbed system. 

The first step in the weak lensing analysis is to de- 
tect background galaxies, measure their ellipticities, and 
correct the ellipticities for smearing due to the point 
spread function. We us ed the prescription given in 
iClowe fc Schneider! l)2002|) for performing this step, in 
which the objects are detected and have their photome - 
try measured using SExtractor l)Bertin &: Arnoutal 996). 
their shapes measured using the IMCAT software pack- 
age (http://www.ifa.hawaii.edu/kaiser/imcat), and the 
PSF sme aring correction performed using the KSB tech- 
nique l|Kaiser et alJll995^ . Background galaxies were se- 
lected using the criteria of having / > 20, B — I < 3.2, 
having a detection significance in I greater than 11, and 
having a 50% encircled light radius larger than that of 
stars. This selection resulted in a catalog with a density 
of 12.3 galaxies/ arcmin 2 over a box 6'.7 on a side, which 
is complete, as measured by the departure of the number 
counts from a power law, to / ~ 24.5, with the faintest 
galaxy having / = 25.97. 

The next step in weak lensing analysis is to convert the 
measured shear into a measurement for the convergence 
k, which is related to the surface density of the lens E 
via 



■^crit 

where £ C rit is a scaling factor: 

c 2 D s 

where D s is the angular distance to the source (back- 
ground) galaxy, D\ is the angular distance to the lens 
(cluster), and D\ s is the angular distance from the lens 
to the source galaxy. Using the same magnitude and 
color selec tions on the HDF-S p hotometric redshift cat- 
alog from iFontana et all l|1999D as were used to create 
the background galaxy catalog results in a mean lensing 
redshift of z hg = 0.85, and a E cri t = 3.1 x 10 9 M Q /kpc 2 . 

Shown in Fig. ^ in solid dark contours is a map of 
k for this field created by using the KS93 algorithm 
ijKaiser fc Sauiresl 11993(1 which uses the fact that both 
the shear and the convergence are combinations of vari- 
ous second derivatives of the surface potential, and there- 
fore the Fourier transform of the shear can be converted 
into the Fourier transform of k by the multiplication of 
the appropriate wave numbers. Because we are recon- 
structing a small field around a massive cluster, however, 
we actually measure the reduced shear g = j/(1—k) from 
the background galaxy ellipticities. Therefore we must 
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Fig. 1. — A) Shown in greyscale is the /-band VLT image used to measure the galaxy shapes for the background galaxies. Overlayed in 
black contours is the weak lensing mass reconstruction with solid contours for positive mass, dashed contours for negative mass, and the 
dash-dotted contour for the zero-mass level, which is set such that the mean mass at the edge of the image is zero. Each contour represents 
a change in the surface mass density of 2.8 X 10 8 JV/Q/kpc 2 . B) Shown in greyscale is the Chandra X-ray image from fMarkcvitc h et alj 
12002) with the same weak lensing contours as in panel A. C) Shown in greyscale is the luminosity distribution of galaxies with the same 
B-I colors as the primary cluster's red sequence. Overlayed are the same mass contours as in panel A. D) Shown in greyscale is the mass 
reconstruction of the field after subtraction of the best-fit King shear profile for the primary cluster. Overlayed in are the same mass 
contours as in panel A. E) Shown in greyscale, with the same color stretch as in panel D, is the mass reconstruction of the field after the 
background galaxies have been rotated by 45 degrees. This provides a good indication of the level of the noise in the reconstruction. The 
contours for the noise are drawn at the same values of K as for the mass reconstruction in panel A. 



perform an iterative solution to the KS93 algorithm in 
which an initial re map is assumed (in this case re = 
everywhere), g corrected with this map to 7, which is 
then transformed to a re map, which is then used in turn 
to correct g, etc ijSeitz fc Schneider! Il995j) . This tech- 
nique typically converges in a few iterations (in this case 
6), and gives an measurement of re in the field relative to 
the level of re at the edge of the image, which is unknown. 

As can be seen in Fig. ^ two distinct mass peaks are 
found in the field, each of which is spatially coincident 
with an overdensity of galaxies. Spectra for galaxies in 
both structures have been published in iBarrena et al.l 
(2002) and the two groups have the same redshift. The 
peaks have significances of 6.4ct for the larger eastern 
peak (hereafter referred to as "the cluster") and 3. Oct for 
the smaller western peak (hereafter referred to as "the 
subclump"). The significances were measured by con- 
volving the mass maps with Mexican-hat filters and com- 
paring the filtered value at the peak position with those 



of randomizations of the mass maps. The randomizations 
were performed by first subtracting a smoothed value of 
the shear (smoothed using a 22'.'4 Gaussian weighted av- 
erage of the surrounding galaxy ellipticities) from the 
galaxy shear estimates to obtain an estimate of the in- 
trinsic cllipticity of the galaxies, then applying a random 
spin to the orientation of each background galaxy while 
preserving their positions and intrinsic ellipticities, and 
creating mass maps from the catalogs. 

An X-ray luminosit y map from Chandra data 
( Mar kevitch et alJ l2002) is overlayed in grey contours in 
Fig-D As can be seen, both peaks are also visible in the 
X-ray data, but are offset in position from both the galax- 
ies and the mass peaks. From the shape, strength, and 
locatio n of the shock visible in t he X-ray peak for the sub- 
clump, [MaAevitc^On l|2002j) have concluded that this 
system has just undergone initial infall and pass-through, 
and the two clusters are now moving away from one an- 
other. The separation between the galaxies, which are 
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Fig. 2. — Shown above is the error map for the centroid of the 
weak lensing mass peak associated with the subclump, generated 
using 10,000 bootstrap resamplings of the background galaxy cata- 
log. The thick black contours indicate the regions containing 68%, 
90%, 95%, and 99% of the centroid positions after smoothing with 
a 2" FWHM Gaussian kernel. The solid circle shows the position 
of the X-ray peak associated with the subclump and the dash con- 
tour shows the boundary of the gas associated with the subclump. 
The solid star shows the centroid of the galaxies in the subclump 
with the encompassing circle showing the ler error contour of the 
centroid. 



effectively collisionless particles in such a pass-through 
event, and the X-ray gas is a result of the ram pressure 
of the interacting gas halos slowing down the X-ray halos 
during the interaction. As a result, a separation between 
the mass peak and the X-ray peak and an agreement in 
positions between the mass peak and galaxy overdensity 
would suggest that the dark matter component of the 
cluster must be relatively collisionless, as compared to 
the X-ray emitting baryonic gas. 

In order to place limits on the collisional cross-section 
of dark matter from the displacement of the mass peak 
from the X-ray peak, we calculated the error on the cen- 
troid determination of the subclump by performing mass 
reconstructions on 10,000 bootstrap-resampled catalogs 
of the background galaxies. For each reconstruction the 
k map was convolved with a Mexican-hat filter to de- 
tect the nearest peak to the position of the subclump 
and measure its significance, imposing a minimum sig- 
nificance of la. The resulting distribution of positions is 
plotted in Fig. [2 As approximately 2.5% of the recon- 
structions should have the mass peak associated with the 
subclump at less than la significance, we eliminated the 
250 most distant peaks from the position of the subclump 
in the data. The remaining peaks have a rms positional 
offset of 12'/1. The separation between the mass peak and 
X-ray peak in the data is 22'.'6, which is significant at a 
1.9a confidence level. This 1-D error analysis, however, 
is at some level incorrect as the distribution of the peak 
positions is not a circular Gaussian and the resampled 
peak distribution has a larger rms errors in right ascen- 
sion than in declination. To measure the significance in 
the 2-D peak position distribution we binned the data 
into 1" x 1" bins and drew contours of decreasing num- 
ber of peaks until the contour intersected the position of 
the X-ray peak. Located inside this contour were 95.5% 
of the resampled peaks. We discuss constraints that this 



system gives on the co llisional cross-section of d ark mat- 
ter in a related paper ijMarkevitch et al.ll2003a|h 

The X-ray gas of the cluster is also offset from the clus- 
ter galaxies and associated dark matter peak. The dark 
matter peak is in good spatial agreement with the clus- 
ter galaxies, and the difference in the shape of the dark 
matter peak relative to the galaxy luminosity distribu- 
tion seen in Fig. ^ is consistent wit h being caused b y 
the noise in the mass reconstruction C Cl owe et al. 1 12000 1. 
Using the same bootstrap-resampled catalogs described 
above and looking for the nearest peak to the position 
of the cluster gives the significance of the offset between 
the X-ray gas and dark matter to be ~ 3.4er. The offset 
gas, however, is a combination of the gas from the cluster 
and gas stripped from the outskirts of the subclump, and 
therefore requires more complicated physics to interpret. 

Because the KS93 mass reconstruction can only mea- 
sure the mass relative to the mean mass at the edge of 
the field and that the images are smaller than the ex- 
pected dynamical size of the cluster (6.'7 = 1770kpc), 
one cannot measure the mass of the cluster reliably with 
the mass reconstruction in Fig.^ Instead, we have mea- 
sured the mass of the cluster using radial shear profile 
fitting in which one assumes a surface mass model for 
the cluster, converts this into a n profile, and then into a 
profile for the reduced shear which is compared to the az- 
imuthally averaged shear profile from the data, as shown 
in Fig. |31 We tried fitting a singul ar isothermal sphere, a 
NFW model ijNavarro et al.lll99fih , and a King model to 
the data and found that the King model was marginally 
preferred over the NFW model, as measured by the 5x 2 
between the mod el reduced shear profile and the data. 
Using an F-test l)Bevington fc Robinson! IT992T) to com- 
pare the 1-parameter SIS to the 2-parameter NFW and 
King models resulted in both the NFW and King mod- 
els being preferred to an SIS at 91% confidence. We 
excluded a 1' diameter region around the subclump from 
the shear profile in order to minimize any contamination 
of the profile from the subclump. Even with this ex- 
cluded area, however, the fit will still include the mass 
of the subclump in the total mass of the cluster for radii 
larger than the subclump-cluster separation which will 
have the effects of overestimating the total mass of just 
the cluster itself as well as underestimating the concen- 
tration of the cluster. It should also be noted that at 
small smoothing lengths, the mass reconstruction of the 
field shows two mass peaks for the main cluster, and thus 
the fact that the King core-model profile is the preferred 
mass profile may be due to the blending of two peaks in 
the radial profile rather than a core in a single peak. 

The King model has a mass density profile 

PO) = z P " 2l 3 (3) 
which integrates to have a surface density profile 



where po, the central mass density, and r c , the core ra- 
dius, are the fitting parameters, r is the 3-D radius, and 
X is the 2-D projecte d radius. The integr ated NFW pro- 
file can be found in iBartelmannl (§1996) . The best-fit 
parameters were po = 3.85 x 10 6 M Q /kpc 3 , r c = 214kpc 
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Fig. 3. — Plotted above is the reduced shear profile for the main 
cluster in the 1E0657— 558 system. Also shown are the reduced 
shear profiles for the best-fit NFW (solid line), King (dashed line), 
and SIS (dash-dotted line) models. 
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Fig. 4. — Plotted above is the minimum surface mass profile for 
the subclump generated using aperture densitometry centered on 
the centroid of the galaxy distribution. The solid line is the mass 
profile when centered on the mass peak in the 2-D mass reconstruc- 
tion. The error-bars for aperture densitometry are correlated such 
that every data point has knowledge of the values of the points at 
larger radius. 



has been effectively removed from the lensing signal, as 
can be seen in Fig. ^ We fitted the three mass model 
profiles to the subclump shear profile, and while the King 
model was the preferred model, its x 2 indicated that it 
was not a good fit to the data. Instead, we have mea- 
sured th e mass of the subclump using aperture densit- 
ometry (jClowe et all 120001 iFahlman et alJll994|) . which 
measures the mean surface density inside a cylinder of a 
given radius minus the mean surface density in an annu- 
lar region. 

Using a 150 kpc radius for the disk and an annular 
region with radial extent of 640 kpc to 706 kpc resulted 
in a mass measurement of 7.3 ± 2.1 x 1O 13 M0 when cen- 
tered on the middle of the subclump mass peak seen in 
Fig-Hand 6.6 ± 1.9 x 10 13 Mq when centered on the cen- 
troid of the red subclump galaxies. The choice of a 150 
kpc radius for the mass measurement was made for two 
reasons: First, the mass profile for the subclump, shown 
in Fig. shows evidence for a plateau in mass associ- 
ated with the subclump between 150 and 250 kpc. This 
might indicate that the dark matter at larger radii has 
been tidally stripped from the core during the interac- 
tion. Secondly, the noise level in the aperture densit- 
ometry increases dramatically with radius, so that the 
signal-to-noise of the mass measurement goes from more 
than 3cr at lOOkpc < r < 150kpc to less than 1.5<t at 
r > 300kpc. 

The observed shear and derived mass of the subclump 
are significantly higher than could be produced by an 
isothermal sphe re with a 212 km/s velocity dispersion, 
as measured bv lBarrena et al.l ((2002). This velocity dis- 
persion, however, is measured from only 7 galaxies and 
could be biased low by their method for distinguishing 
a cluster galaxy from a subclump galaxy. Further, the 
conversion of a velocity dispersion to mass measurement 
requires the assumption of virial equilibrium, which is 
unlikely to apply to the subclump. The weak lensing 
mass is in good agreement with the 7 keV X-ray temper- 
ature for the cold gas blob, under the assumption that 
this was the temperature of the subclump prior to the 
interaction. 



for the King model and r2oo = 2250kpc, c = 3.0 for 
the NFW profile. Both models have the two param- 
eters degenerate in the fits with poor constraints on 
both c and r c . The significances of the fits, as mea- 
sured by the 5\ 2 between the model fit and a zero mass 
model fit, is 6.48 for the King model and 6.37 for the 
NFW model. The King model has surface mass mea- 
surements of 9.5 ± 1.5 x 10 13 M Q , 2.0 ± 0.3 x 10 14 M Q , 
and 4.4 ± 0.7 x 10 14 M Q for 150, 250, and 500 kpc re- 
spectively. The NFW model has surface mass measure- 
ments of 1.02 ±0.16 x 1O 14 M , 2.1 ±0.3 x 10 14 M Q , and 
5.3 ± 0.8 x 10 14 Mq for the same radii. These masses 
are in good agreement wit h the velocity dispers ion for 
early-type galaxies given bv lBarrena et al.l (J2002). 

In order to measure the mass of the subclump, we first 
had to remove the mass of the main cluster which would 
otherwise provide a large positive bias to the mass mea- 
surements. This was accomplished by subtracting the 
reduced shear profile of the best-fit King model for the 
cluster from the background galaxy catalog. A mass re- 
construction of this catalog shows that the main cluster 



3. PHOTOMETRY 
3.1. Centroid of the Galaxy Distribution 

For the standard CDM paradigm we expect the dark 
matter distribution to be coincident with the galaxy dis- 
tribution if the dark matter particles are collisionless. To 
test this expectation we employ adaptive kernel smooth- 
ing to determine the centroid of the galaxy distribution 
associated with the lower ma ss sub-cluster. The me thod 
is similar to that described in lGonzalez et all l)2002t) . An 
Epanechnikov kernel with h = 30" is used for the adap- 
tive smoothing and the highest density peak within a 
50" radius is identified as the sub-cluster. We restrict 
the input catalog to objects with m/=18-25, B — I color 
within 0.5 magnitudes of the brightest cluster galaxy 
(B — I = 3.9), and SExtractor stellarity indices less than 
0.5 in both bands. We compute the number-weighted 
and flux- weighted ccntroids, finding that both weight- 
ing schemes yield indistinguishable results. The uncer- 
tainty is calculated by recomputing the centroid for 1,000 
bootstrap-resampled catalogs. 

We find that the peak of the galaxy distribution is co- 
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incident with the location of the brightest cluster galaxy. 
The number-weighted centroid is 06 : 58; 15.66, —55 : 56 : 
35.3 with a 1.9" la Gaussian uncertainty, while the flux- 
weighted centroid is 06 : 58 : 15.75, —55 : 56 : 35.3 with 
a 3.0" la uncertainty. The net separation between the 
galaxy and weak lensing centroids is 12"3, which has a 
significance of la in the 1-D error analysis and 70% in 
the 2-D error analysis described in the previous section. 

3.2. Luminosity 

The luminosity is determined using two approaches. 
We first derive the luminosity from the galaxy catalog, 
which is the typical method used in cluster M/L mea- 
surements. We then directly compute the luminosity 
by integrating the total flux within the aperture, which 
places a hard upper limit on the luminosity. For both 
techniqu es we correct the de rived luminosities for ex- 
tinction l)Schlegel et al.lll998|) and apply e + K correc- 
tions to convert to rest-frame passbands. The applied 
corrections are based upon the Bruzual & Chariot evolu- 
tionary models for a passive l y evolving elliptical galaxy 
lIBruzual A. fc CharloHll993l iCharlot et~afl lT996). The 
integrated B— I color is consistent with the prediction for 
passive evolution, indicating that this approximation is 
reasonable. Absolute magnitudes are converted to units 
of solar luminosity using the solar absolut e magnitudes 
given in Allen's Astrophysical Quantities ijAllen fc Coxl 
2000). 

For the catalog approach, we first cull the input pho- 
tometric catalog to minimize contamination from stars 
and foreground galaxies. We exclude all objects that are 
brighter than the BCG, more than 1 magnitude redder 
than the BCG (i.e. B - I < 4.9), or have stellarity 
index >0.8 and mj < 20. Two foreground spirals are 
also removed from the input catalog. The flux from the 
remaining galaxies is then summed, with the resulting 
luminosity shown in Tabled We caution that there are 
two caveats with this approach. First, we lack sufficient 
spatial coverage to employ background subtraction. The 
impact of background contamination is expected to be 
minor because of the large density contrast within our 
physical apertures, but such contamination will yield a 
positive bias in the derived luminosity. This effect will 
be greatest for the largest apertures. Second, incomplete- 
ness at the faint end of the luminosity function yields a 
negative bias in the derived luminosity. Because our data 
is complete to roughly 5 (6) magnitudes below L* in / 
(B), the magnitude of this effect should be 4.5 (2.5)% for 
a faint end slope a = —1.3. 

Directly integrating the flux within the apertures pro- 
vides a useful cross-check on the above technique. For 
this approach we only mask stars with stellarity index 
>0.8 and mj < 18. We then integrate the flux within 
the aperture, using two additional apertures located 2' 
north and south of the cluster to quantify the background 
sky level. The sky level in these apertures were com- 
puted after masking detected objects in the region, and 
thus the detected background level does not include flux 
associated with resolved galaxies (except that scattered 
onto the extreme wings of the PSF). Background galaxy 
contamination is expected to yield a slight positive bias 
in the derived luminosity, as with the catalog approach. 
The results are shown in Tabled The luminosities de- 
rived via this method typically agree the catalog results 



to within 15%. 

3.3. Mass-to-Light Ratios 

We determine the mass-to-light ratios of both the main 
cluster and the sub-cluster in rest-frame B and /. If the 
sub-cluster has suffered significant mass loss during pas- 
sage through the core of the main cluster, this should be 
reflected by a decreased mass-to-light ratio. We find no 
evidence for such mass loss, with the M/L ratios for both 
components consistent with one another to within the la 
uncertainties. This result implies, under the assumption 
that the initial M/L ratio for the two structures were 
similar, that the dark matter interaction cross-section 
must be small, a topic that i s explored in greater de - 
tail in an accompanying paper ijMarkevitch et al.l2003aj) . 
We further note that the derived M/L values are consis- 
tent with o t her re cent lensing-derived mass-to-light ra- 
tios. IDahld (|200(y for instance finds M/L B = 259 ± 12 
for a sample of 40 low-redshi ft cluste r s. Th e low disper- 
sion of the M/L ratios in the IDahld 1)2000}) sample also 
suggests that the assumption of similar intrinsic M/L 
ratios for the two components in this cluster is not un- 
reasonable. 

For these mass-to-light ratio calculations for the sub- 
clump we use an aperture centered on the centroid of 
the galaxy light distribution, assuming that the offset in 
the mass peak seen in Fig. ^ is a result of noise in the 
shearfield. If instead we use an aperture centered on the 
observed mass peak for the subclump, the mass-to-light 
ratio of the subclump increases by ~ 10%. Also, the 
mass estimates for the subclump were created by sub- 
tracting the mean surface density in a 640 — 760 kpc 
annulus from the mean surface density within the 150 
kpc disc, and since no similar subtraction was performed 
on the light or the cluster mass, the mass-to-light ratio 
of the subclump must be considered a minimum value in 
making comparisons with the main cluster. 

4. DISCUSSION AND CONCLUSIONS 

In a CDM universe, one would expect that the mass 
peaks for the cluster and subclump would agree with the 
centroid of the galaxy distributions, as both galaxies and 
dark matter particles are collisionless in such an interac- 
tion ijTormen et alJl2~003|) . One would also expect that 
the mass-to-light ratios would decrease by ~ 10 — 15% 
as compared to relaxed systems due to the baryonic X- 
ray halo mass being removed from the structures. Such 
a scenario is, within errors, in good agreement with the 
data. 

In a purely baryonic MOND universe the X-ray and 
galaxy centroids would still be separated as the galaxies 
are still collisionless particles in the interaction. How- 
ever, because the X-ray halo is the dominant mass com- 
ponent of the visible baryons in the cluster, in the 
absence of a dark mass component the vast majority, 
~ 85 — 90%, of the mass of the subclump would be with 
the X-ray gas. Thus, any direct method to measure the 
mass of the system would detect a higher mass about 
the stripped X-ray halo than around the galaxies. This 
is not what is observed in this system. In order to quan- 
tify how much these observations disagree with MOND, 
however, we first need to determine a method to measure 
the masses of the clusters in a MOND cosmology. 
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Table 1. Mass-to-Light Ratios 



Region R M L B Li M/L B M/Li 

(kpc) (1O 14 M ) (IO^Lq) (io"r ) 



Integrated Flux Technique 



Sub-cluster 


150 


.66±.19 


2.1 


4.9 


314±90 


135±39 


Main 


150 


.95±.15 


3.5 


7.8 


271±43 


122±19 


Main 


250 


2.0±0.3 


8.5 


15.8 


235±35 


127±19 


Main 


500 


4.4±0.7 


17.4 


32.5 


253±40 


135±22 






Catalog 


Technique 








Sub-cluster 


150 




2.4 


3.8 


275±79 


174±50 


Main 


150 




3.2 


6.3 


297±47 


151±19 


Main 


250 




8.1 


12.9 


247±37 


155±23 


Main 


500 




21.4 


28.8 


206±32 


152±24 



Note. — Due to the different method in which the main cluster and subclump masses were measured, the subclump M/L 
ratios must be considered a lower bound when comparing with the main cluster. 



Unfortunately, because there is not a derivation of 
MOND from general relativity, there is not a definitive 
way to measure a mass with weak lensing from a mea- 
sured shear field. If one assumes that the relation from 
general relativity between the deflection of a photon and 
of a massive particle moving at the speed of light by a 
static gravitational field is unchanged by MOND, then it 
can be shown that the shear field caused by a point mass 
is 



70) = 



e 3 



2(6 



(5) 



where 9 is the distance from the point mass, 6>e is 
the Einstein radius for the lens, 9q is the distance at 
which the gravitational acceleration changes from New- 
tonian to MOND, and 9 out is the distance at which 
the gravitational accele ration changes back to a 6~ 2 
law (jMortlock fc Turnerll20f)ll iHoekstra et alJl2002T) . 



IHoekstra et al]l2002ft . As 

both 9e and 9q scale as the square root of the point mass, 
the resulting shear profile scales linearly with the mass 
for 9 <C 9q and 9 ^> 9 out , as the square root of the mass 
for 9q <C 9 <C # ut, and somewhere between the two ex- 
tremes for the transitional regions 9 ~ 9q and 9 ~ 9 ou t- 

Calculating the expected shear profile for an extended 
source in the MOND regime is complicated by the lack of 
a thin lens approximation, which is used to simplify the 
equat ions with Newtonian gravity (Mortloc k fc Turner! 
12001 . However, it is reasonable to assume that the same 
general relation between the gravitational shear field and 
overall mass of a halo exists as per the point mass rela- 
tion. As such, the level of weak shear produced by a 
cluster of galaxies at a radii of a few hundred kpc from 
the cluster core (which would be between 9q and 9 ou t for 
10 13 to 10 16 M Q clusters) should scale with the mass of 
the cluster, probably somewhere between a linear scale 
and a scale with the square root of the mass. 

From observations, we know that the shear fields 
produced by indivi dual galaxies ijHoekstra et alJ l2003t 
iMcKav et alJ 120021) are an order of magnitude lower 
than those produced by galaxy groups l)Hoekstra et al.1 
1200 which are an order of magnitu de lower tha n 
those produced by poor clusters l|Wittman et al.1 12000'). 
which are in turn signi ficantly lower than those pro- 
duced by rich clusters llpahle et al.1 120021 iClowe et al.1 
MM IClowe fc Schneider! 120011120021) . 



As the amount 



of visible baryons in these structures scale in a similar 
manner, then from these observations we have support 
for the above assumptions. Thus, in a MOND universe, 
one should still observe a change in the shear field of a 
structure with a change in the mass of the structure. 

As a result, if the mass of clusters of galaxies is limited 
solely to visible baryons, then by removing the X-ray halo 
from the cluster one should reduce the gravitational shear 
centered on the galaxies by at least a factor of three, if 
the shear scales as the square root of the mass, and up to 
a factor of ten if the shear scales linearly with the mass. 
In this system, however, we find that the ratio of the 
gravitational shear to visible light for two components 
which have the X-ray halo stripped from the galaxies is 
consistent with that found in normal clusters, which have 
the X-ray halo and galaxies spatially coincident. This is 
inconsistent with the shear scaling as the square root 
of the mass MOND model at roughly a 2a confidence 
level and with the shear scaling linear with the mass at 
roughly a 3er confidence level. 

In order to reduce the inconsistency with the data to 
a ~ 1(7 confidence level, one would need to add a non- 
luminous mass component to the clusters which is equal 
to the mass of luminous matter for the shear scaling as 
square root of mass case, and which is 2.5 times the mass 
of luminous matter for shear scaling linearly with mass 
case. This extra mass component would also reduce the 
problem with the detected mass peak for the subclump 
being closer to the galaxies than the X-ray halo as the 
detected signal would be a blend of the two components 
due to the required smoothing of the mass map. 

The more significant offset between the cluster mass 
peak and X-ray halo would require a greater amount of 
dark mass to explain if the two components were cleanly 
separated. The X-ray halo, however, is extended over 
the cluster galaxies which may indicate some fraction of 
the X-ray gas has already been drawn back to the galaxy 
position. 

Any dark mass component of the system must be rel- 
atively collisionless, so it can undergo the pass-through 
without loss of velocity or mass, and able to clump on 
scales smaller than 100 kpc (the smallest aperture for 
which we can reliably measure the shear about the sub- 
clump). Adopting big-bang nucleosynthesis limits on the 



mean baryonic mass of the universe excludes most of 
this mass from being baryons in cold, condensed struc- 
tures. The dumpiness limit excludes the matter from 
being massive neutrinos with masses less than 4.5 eV 
( Sand ersl2003t iTremaine & Gunnlll979Tl . Since neutrinos 
more massive than 2.2 eV have been ruled out experimen- 
tally ijBonn et al.ll2002t lLobashev et~alll2001(l , neutrinos 
thus cannot explain this mass. 

4.1. Summary 

We have shown above that the cluster 1E0657— 558 has 
a lower-mass subclump visible in X-ray and optical obser- 
vations as well as in a weak lensing mass reconstruction. 
The X-ray and optically luminous components are spa- 
tially separated at high significance, as one would expect 
for a system which has just undergone a n initial infall 
and t ransit of a larger mass system (e.g. iTormen et alJ 
2003). The observed mass peak in the weak lensing re- 
construction lies between X-ray and optical components, 
but is closer to, and consistent with, the optical com- 
ponent. The centroid of the subclump mass peak has a 
fairly large error resulting in the offset of mass peak from 
the centroid of the galaxy distribution having a ~ 70% 
confidence level and the offset of the mass peak from the 
X-ray peak having a ~ 95% confidence level. 

The primary cluster has also been detected in the weak 
lensing mass reconstruction, and has a mass peak which 
is spatially coincident with the cluster galaxies. The X- 
ray gas from the main cluster is offset from the mass peak 
at a 3.4cr significance. 

We have also measured the mass-to-light ratio for the 
subclump at a 150 kpc radius and for the main cluster 
at 150, 250, and 500 kpc radii. We find that the sub- 



clump has a mass-to-light ratio which is consistent with 
the mass-to-light ratio of the main cluster, and that both 
are consistent with mass-to-light ratios for relaxed clus- 
ters. The dominant source of error in the mass-to-light 
ratios and the mass-X-ray gas offsets is the weak lensing 
mass reconstructions, which can be improved by obtain- 
ing shear information on a wider field than the 7' x T 
VLT field and/or by obtaining deeper imaging on the 
same field with a smaller PSF in order to greatly in- 
crease the number density of background galaxies usable 
for the measurement of the shear field. 

Finally, we have argued that even in a MOND uni- 
verse, a significant fraction of the original mass of the 
subclump must exist in the form of dark matter which, 
furthermore, should be non-baryonic and non-neutrino. 
The exact amount of extra mass cannot be calculated 
due to the lack of a MOND derivation from general rela- 
tivity, but phenomenological arguments suggest that it is 
at least equal to the baryonic mass of the cluster. While 
these observations cannot disprove MOND, or alterna- 
tively prove that gravity is Newtonian on small accelera- 
tion scales, they remove its primary motivation of avoid- 
ing the notion of dark matter. 
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